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Regulat ry sequences of tbem use villiii gene - Use in transgenesis. 

The invention relates to regulatoiy sequences of the mouse villin gene that efficiently 
drive transgenic expression in immature and differentiated epitheha] cells of the 
intestine and uro-genital tracts. 

Villin is a cytoskeletal protein which is mainly produced in epithelial cells that develop 
a brush-border responsible for absorption as in the digestive apparatus (epithelial cells 
of the large and small intestine) and in the urogenital tract (epithelial cells of the kidney 
proximal tubules). Because it is expressed in the proliferative stem cells of the intestinal 
crypts (16,17), it is beheved to be an early marker for coxnmited intestinal cells. The 
multiple levels of regulation control villin gene activity during mouse embryogenesis 
(18-20) and account for the strict pattern of tissue-specific expression observed in 
adults. Moreover, the expression of the villin gene in intestinal epithelial cells is 
conspicuously maintained in their correspondant carcinomas (2 1 -24). 
The specific expression pattern of villin suggests that it is an appropriate candidate for 
the characterization of regulatory sequences that could allow targeting of heterologous 
genes into a selected population of cells in the mouse digestive tract 
In order to design new constructs and systems enabling the targeted expression of genes 
in epithelial cells of intestinal or urogenital tracts, the inventors have investigated the 
underlying molecular mechanisms and particularly those responsible for the restricted 
tissue specificity of the expression of villin. 

The invention therefore provides new regulatory sequences encompassing cis-acting 
elements involved in the regulation of the transcription and of the expression of the 
murine villin gene. 

The invention also relates to recombinant constructs comprising said regulatory 
sequences, for the control of the targeted expression of determined nucleic acid 
sequences so-called (heterologous sequences or also transgenes), in cells or tissues 
originating from the intestinal mucosa. 
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A further object f the invention is lo provide cells, tissues or organisms including 
animals, expressing said determined nucleic acid sequences in a targeted manner. 
Transgenic mice are routinely used to study the molecular and cellular basis of normal 
and pathological states in intestinal mucosa (1-5). The major limitation regarding the 
targeting of exogenous iransgenes in this tissue, is that the epithelium of the mouse 
intestinal mucosa is renewed every 2^5 days (6^8). The epithelial ceUs arise from 
multipotent stem cells functionally anchored at the base (more precisely in the lower 
third) of the epihelium's proUferative compartment, the crypts of Lieberkiihn. These 
crypts display a monoclonal organization since they are each derived from a single 
progenitor cell (9). Descendants of stem cells multiply in the middle portion of each 
crypt (10), and gradually differentiate into four principal cell types. In the small 
intestine, absorptive enterocytes (constituting >80% of the epithelial cells), mucus- 
producing globlet cells and enteroendocrine cells migrate upward from the crypts to the 
apex of surrounding villi (whose colonic counterparts are hexagonal shaped cuffs) (11), 
life where they become apoptotic and are exfoliated into the gut lumen (12). In contrast, 
J'' antimicrobial peptides-secreting Paneth cells migrate to the bottom of the crypts, where 
^ f they reside for about 20 days (13). 

Q Given the remarkable protective effect of this epithelium, it is not unexpected that most 
:^ previous smdies aiming to induce neoplastic transformation in intestinal mucosa of 
|30 transgenic mice have failed (14, 15). In these prior reports, the use of promoter 
sequences which direct oncogenes in non proliferating enterocytes located in the upper 
third of crypts produce only minor phenotypic abnormalities without tumorigenic 
consequences in the gut epithelium, suggesting that the residence time of these villus- 
associated cells may not be sufiGcient for the oncogenes to exert their effects. 
25 Furthemore this suggests that transgenic mouse models of neoplasia may require an 
efficient targeting of oncogenes in crypts stem cells or their immediate descendants. 
With this goal in mind, the human villin gene has been isolated and characterized (2S). 
A 2 kb 5 '-flanking region has been found to contain suf&cient regulatory elements to 
promote tissue-specific expression of a reponer gene in intestinal and renal cell lines 
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(26) In transgenic mice, this reguiai ly region is able to drive the expression of the 
human Ha-ras oncogene in the tissues in which the endogenous gene ,s acnvely 
ixanscribed. However low levels of expression were observed that did not tngger 
malignant tissue appearance into the gut of ihese animals. 

The invention provides new means for the targeted expression of heterologous 
sequences in cells originating from intestinal and/or urogenital tracts. Advantageously, 
the invention encompasses but is not limited to nucleotide sequences that should 
overcome at least in part, some of the deficiencies of the previously described gene 
constnicts prepared with regulatory sequences of the human villin gene : especially 
having recourse to regulatory elements promoting the expression of the murine villin 
gene, the inventors have designed new DNA constructs that may improve the efBciaicy 
of the targeted expression of heterologous genes in transgenic animals, with respect to 
the results obtained when said heterologous genes are placed under the control of the 
human villin regulatory elements contained in the above disclosed 2 kb sequence. 
The inventors have analysed an extended genomic region of the mouse villin gene with 
the goal of mapping elements localized at the 5' and/or 3' ends and possibly involved in 
promoting high levels of targeted expression of heterologous sequences in epithelial 
stem or differentiated cells, specifically originating from the intestinal mucosa. 
As a result, fee inventors have identified a transcripiional regulatory region that enables 
efficient targeted tissue-specific or cell-specific expression, reproducing the expression 
pattern of the endogenous villin gene in mice, including expression in the crypt stem 
cells of the colon and in differencitaied cells. 

In accordance with the invention the identified villin regulatory region provides a 
molecular tool for the establishment of new cell lines, including new immortalized cell 
lines, particularly epithelial cell lines originating from intestinal tissue. It also provides 
means appropriate for the preparation of transgenic organisms, especially transgenic 
animals, including transgenic mice, enabling the targeted expression of determined 
nucleotide sequences. 
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The invention therefore disci ses appropriate tools for the study of palholosical states 
related to a dysfimction of gene expression partem or for the treatment or prevention of 
said pathological states, for example to enable targeted expression of a gene acting as 
repairer gene in order to compensate for the dysfunction of an endogenous gene, or to 
add a novel function in cells or to suppress a detemiined fimction or state, or its 
consequences. 

The study of pathological states can benefit from the establishment of transgenic model 
animals wherein induction of a pathological state is obtained in relation with the 
targeted expression of genes. It also relates to the smdy of rescue of pathological states. 
The invention thus relates to a nucleotide sequence derived from ihe 5' sequence of the 
murine viUin gene, having a size of 9 kb on an agarose gel, or a fragment thereof, 
comprising nucleotide elements having a cis-regulatoiy activity that promotes the 
transcription of the murine villin gene. 
^ In accordance with the invention, the expression "nucleotide sequence" designates any 
y type of nucleotide sequence, especially DNA, whatever its origin, including genomic, 
, cloned, amplified, recombinant or synthetic sequences. 

!3 The teim "derived from" characterizes the fact that the sequence is defined with 

13 reference to the isolated endogenous sequence of the murine villin gene. Said nucleotide 
sequence derived from the murine villin gene nevertheless encompasses sequences 

^ obtained without having direct recourse to the native isolated gene. Furthermore, the 
structural and/or fimctiona] characterizing features of the nucleotide sequence may 
correspond to their native counterpart in the murine villin gene or may be modified as a 
result of mutation, deletion, truncation, or addition of nucleotides or nucleotide 
fragments provided its fimction of promoting the transcription and/or expression of the 

25 villin gene is not substaixtially affected 

The sizes of the nucleotide sequences which are indicated in the present paicnt 
application may vary: indeed, the indicated sizes correspond to the size deduced firom 
the band visualized on an agarose gel, in conditions corresponding to those given in the 
examples. 
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Z^d 15 orlO % of *e sequence, i. cnconrpas^ wite. .be pv« ^ ma..«K.n. 

^ucnc. of i.v..o» . . se^ence ^v^. a ci.-.^-.^ -v.. 
^ p,o»otcs .he .x^nscHption of *e n>.rine vilHn .hus enabling U,c .«.scnp..o„ 
i„ .he p,»du«i.n of mRNA and subsequen. expression of viUin pr.«,n. » 
Wesdne epi«.e.i.. cm. Condirion. appropriau .o US. said resuU.on. aC-ty « 
described in the examples. 

Whc. necessary, access » *e s-nuc DNA of tt.e n.»in= viUin gene ,s enab^ 
O^u-h various available Hb«ies Clones ob.ai.ed from a genomic library can mdecd 
be us'ed .o isola« a.e nuc.eo.ide se.^ences of invention, provided saad Cones 
^uin sequences comprising a *a^=n, having a si« of a. leas. 9 Icb upsaean. from 
tt,e .^.lation initiation codon. Said ftagmen. can be obrained fiom one clone or from 
overlapping genomic sequences courted in several clones. Probes derived from *e 
cDNA of tt.e villm gene can be used to identify appn=pria.e clones in a genomic hbrary 
,5 of murine genes. Especially, such probes can be designed around tire mmsla«on 
intiation codon a»i have advanrageously a. leas. 100 nucleotides, including sequences 
having 200 to 500 nucleotides, or more. 

such probes can also be designed starting from the sequence SEQ ID NO: 1 and can be 
synthesized or obtained by amplification. Tbey can be derived from the 5' sequence 
20 upstream of the translation initiation codon. 

According to specific embodiments, the invention relates to various nucleotide 
sequences derived from said 9 kb sequence. Especially, the invention relates to: 
. a nucleotide sequence which is the sequence extending 5.5 kb upstream and 3.5 kb 
downstream from the transcription initiation site of the murine villin gene ; 
25 - a nucleotide sequence which is identified as SEQ ID NO: 1 and which is represented 
on Figure 6; 

- a nucleotide sequence which comprises or which is the nucleotide fragment extending 
from the HS I to the HS IV Dnase I-hypersensiiive sites; 
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The DNAsel hypersensitive sites can be mapped on a genomic DNA corresponding to 
the mouse viUin gene with the probes described on figure 2. 

- a sequence which comprises or which is the nucleotide fragment extending from the 
HS IV Dnasel-hypersensitive site to the translation initiation codon of the murine villin 

5 gene; 

- a sequence which comprises or which is the nucleotide fragment extending from the 
nucleotide at around position -100 upstream from the transcription initiation site, to the 
translation initiation codon; 

- a sequence which comprises or which is the nucleotide fragment extending 3.5 kb 
10 upstream from the transcription initiation site to the transcription initiation site and 
13 further comprises the translation initiation codon, and possibly the sequence of exon 1 

and the sequence of exon 2 starting 5' from the ATG codon; 

- a sequence which comprises or which is the nucleotide fragment extending from 
around the nucleotide at position -480 from the transcription initiarion sequence, to the 
translation initiation codon; 

- a sequence which comprises or which is the sequence extending 3.5 kb upstream from 
the transcription initiation site to the translation initiation codon, provided the region 
corresponding to intron 1. located between said sites is deleted, or deleted in part; 

- a sequence which comprises or which is derived from the nucleotide sequence of the 
'20 murine villin gene having a size of 9 kb on an agarose gel and extending 3.5 kb 

upstream from the transoiption initiation site and 5.5 kb downstream, from said site, or a 
fragment thereot said nucleotide sequence or fragment thereof having a regulatory 
activity on the transcription of the murine villin gene and/or on the level of expression 
of the murine villin gene in intestine ceUs and/or in transgenic mice. 
25 The transcription regulatory activity of the murine villin gene is cited as a reference for 
the identification of nucleotide sequences encompassed within the scope of the 
invention. The invention fiiithennore relates to a nucleotide sequence which is derived 
from said 9 kb sequence of the murine villin gene and which enables the transcription of 
a heterologous sequence, with respect to this gene. 
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According to another embodiment, the invention relates to a sequence having a 
regulatory activity on the transcription of a villin gene, which sequence is a non-human 
sequence and which hybridizes in stringent conditions with at least one of the above 

nucleotide sequences. 

Especially variant nucleotide sequences can be obtained from other animals of the 
Vertebrates or Invertebrates groups, especially from mammals, or from birds especially 
chicken^or from fishes. Referring to Invertebrates nucleotide sequences can be otained 
from Drosophila or from C ek^ans- 

Although the above defined sequences have been shown as being able to promote 
efficient transcription and expression of heterologous sequences in epithelial cells of 
intestinal origin or in transgenic mice, their smicniral variations may affect the 
efficiency of their regulatory activity, either with respect to the tissue - or cell- 
specificity of this activity, or with respect lo the expression level observed. 
They can therefore be used to promote the targeted transcription and expression of 
genes (or more generally any nucleotide sequence of interest) in epitheUal cells of the 
intestinal or uro-genital tracts, either in stem cells or in differentiated cells. 
The invention thus relates to a recombinant nucleotide sequence which comprises a first 
nucleotide sequence and a second nucleotide sequence for which a tissue specific 
targeted expression in epithelial intestine cells is sought. 

In a particular embodiment of the invention, the second nucleotide sequence is a 
sequence encoding a deteraiined polypeptide, protein or peptide, all designated hereafter 
by the term "polypeptide". 

The second nucleotide sequence may also be a sequence of ther^eutic interest such as 
mmor suppressor gene, a functional inhibitor of a gene, an antisense sequence, an 
oncogene, an immortilizing gene, a normal gene for the restauiation of a function, or 
more generally any sequence the targeted expression of which in epithelial cells of 
intestinal origin may present an interest, for instance in processes for prevoiting, 
controlling or ctjring pathological states including those states related to the 
development of ttmiors. 
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In a particular embodiment of the invention, the second nucleotide sequence codes for 
an antigen or an iininun globulin or for fiagments thereof, including variable chains or 
immunoglobulins. 

In another embodiment of the invention, the second nucleotide sequence is an oncogene. 
5 A recombinant sequence of the invention comprising an oncogene may be used for 
studies relating to carcinogenesis especially in animal models expressing said 
recombinant sequence. 

According to a specific embodiment of the invention the second nucleotide sequence 
which is placed under the control of the regulatory sequences of murine villin gene, is 

10 further placed under the control of an inducible system, for example the 
Tetracycline/Doxycycline mediated temporal control of gene expression in transgenic 

1 mice (Kistner A- ei al, 1996, PNAS, 21> 10933-10938). 

r\ The invention also concerns vectors containing the nucleotide sequences of the 
invention especially plasmids or cosmids, appropriate for the transfer and/or expre^ion 
J§ of these sequences in cells or transgenic animals. 

'■^^ The invention also concerns recombinant epithelial cells comprising a recombinant 
O sequence as described above, 

Itl These epithelial cells encompass stem cells, especially crypt stem cells of the intestinal 

W mucosa, or differenciated cells, especially commited intestinal cells. 

^ The invention further relates to an epithelial cell originating from the kidney proximal 
tubules recombined with the nucleotide sequences of the inveniioiL 
The epithelial cells of the invention can be inunortalized cells, especially as a result of 
the expression of an immortalizing gene (e.g., AgT (lsA58)) the expression of which is 
drived and targeted by the regulatory nucleotide sequence of the invention. 

25 The administTation of the recombinant nucleotide sequence of the invention may be 
made by the available techniques including ex vivo or in vjvo administration processes, 
especially by electroporation, calcium phosphate precipitation, liposomes... 
The invention also concerns transgenic animals obtained by transgenesis of recombinant 
nucleotide sequences of the invention. 
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•n^ese transgenic animals encompass animals from the Vertebrates or Invertebrates 
groups and a« especially birds for instance chicken or fishes or non-human mammals. 
„,ore particularly mice. Invertebrates like Prosophila or Nematodes, like Q^degan^ can 
also be used for the preparation of transgenic animals expressing recombinant sequences 
5 of the invention under the control of regulatory sequences derived from their 
endogenous villin genes. Appropriate processes for the preparation of the transgenic 
animals expressing in a targeted tissue specific way, recombinant nucleotide sequences, 
are disclosed in the following examples. 

Such a process for the preparation of 1«nsgenic mice, advantageously comprises the 
]0 steps of: 

0 - administration of a transgene into the pronuclei of fertilized eggs of mice. 

1 - enabling the development of the recombined eggs to recover transgenic mice 
H (founders) and verifying the presence of the transgene. 

- if appropriate crossing the founders with non transgenic mice. 
% 5 Further steps, including crossing between mice capable of expressing the sequences of 
T the invention with mice expressing different sequences (e.g. sequences encoding 
9 inducible systems) in order to obtain double recombinant mice. 



1=^20 LEGENDS OF THE FIGURES 

Fi?HTg 1 •• 

Determination of tbe transcription stkrt site of the mouse villin gene by primer 
extension. A, primer extension analysis perfomied with mouse intestinal total RNA 
(30 Jig) and with either the end-labeled\ villin oUgonucleotide (generating a 1 OS- 
IS nuleotides extension product) or the end-l^eled mouse intestinal fatty acid binding 
protein gene (Fabpi) oligonucleotide used Jb a positive control (generating a 81- 
nucleotides extension product). The size of theVagments obtained by primer extension 
is shown at the left. The unrelated sequence laddter that was run in the same gel is used 
as a size marker. B, nucleotide sequence betweeAthe transcription start site (the bold 
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adenosine designated as -^1) and the Vitiation codon (the bold underlined ATG codon) 
of the mouse villin cDNA. Each of theWe junctions present in the intion 1 (indicated 
below) conforms to the consensus sp\ce donor (the italique GT nucleotides) and 
acceptor (the ital.que AG nucleotides) paWs, described by Breathnach and Chambon 
(30). C, schematic representation of the orgWzation of the 5' -flanking region of the 
murine villin gene. The open box lepresenis Ve untranslated exon and the shadowed 
box represents the first coding exon. The size o{\e exon and the intron is indicated. 

Pimire 2 : 

DNase I-hypersensitivity in the mouse villin gene. A, a partial restriction map 
diagram of the mouse villin gene regions subcloned (-3.5 to +9.9 kb in respect to the 
transcription start site, indicated by an arrowhead). BamHl (B), BgUl (Bg), Eco RI (E), 
Hind m (H) restriction sites, ATG initiation codon and the probes used to map the 
hypersensiiives sites (0.5, 0.8 and 1.25 kb) are shown. B and C, intestine, kidney, liver 
and spleen nuclei were digested with increasing amounts of DNase I at 0 °C for 10 min 
(0, 20, 40, 80, 160 units). 10 jig of purified genomic DNA was digested with BgiH 
(panel B), and BamHl (panel C), elecnophoresed and transferred to a nylon membrane. 
Hypersensitives sites were revealed by probing with a "P-labelled fi^gment of 0.5 kb. 
Positions of coelectrophoresed molecular weight markers are indicated at the left, and 
the hypersensitive bands are mariced by arrows at the right. The maps r^resenied below 
show the position of restriction sites, the deduced DNase I-hypersensitive sites 
(indicated by arrows) and the 0.5 kb probe used. 



Eigur6 3j 

Transient transfectioD analysis of the mouse villin promoter. A, above a partial 
restriction map diagram of the mouse villin gene from 9 kb with respect to the 
translation initiation codon. Apal (A), Bamm (B). BglU (Bg), BstEU (Bs), Drdl (D), 
Ncol (N) and Xbal (X) restriction sites are shown. The schematic represcntaiion below 
shows the location of the four hypersensitive sites (I-IV) as well as the 5.5 kb intron 
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(represented by a cross-hatched rectangle) separ:itmg the transcription start «te 
(indicated by an arrowhead) and the translation initiation c don. B, diagranis of the 
various constructs generated by deletion. Different portions of the 5'-flanking region of 
the mouse villin gene were fused with the E. coli p-galactosidase gene containing the 
nuclear localization signal (nls). C. ^gaiactosidase activities resulting fiom transient 
tr^fections into CaCo2 colon cells (shadowed bars) or LLCPKl kidney cells (open 
bars) with the reporter constructs generated (represented in panel B). Basal activity 
resulting fiom the promoterless pBasic plasmid was set arbitrarily at 1. Values indicate 
the average of at least three independent transfections. 



Figur?'^ '■ 

Expression pattern of the transgene. Transgene (p^) specific-transcripts were 
detected by reverse transcription-PCR in a ethidium bromide containing agarose gel- 
Above each lane, the different tissues tested and the controls, (+) : kidney mRNA from a 
^^5 mouse in which the |3-galactosidase was inserted at the villin locus (32), and (-) : 
distilled HjO as a template. RT-PCR were also performed on mRNAs of the 
endogenous villin gene and the ubiquitous TFIID gene. 

. I i 

i ^ 

^=^0 p-galactosidase activity in sections of smaU intestine and colon from transgenic 
mice. Tissues were removed from transgenic mice, fixed, and stained for p- 
galactosidase activity with X-Gal as described, p-galactosidase activity was observed in 
the epithelial cells, both immanire and differentiated, along the crypt-viUus axis in the 
small intestine (panel A). Note that the differentiated cells exhibited a strong signal as 
25 did the villus-associated cells and the Paneth cells (arrows) localized to the bottom of 
the ctypt (panel B). The epithelial cells of the colon were also stained (panel Q, 
particularly all the cells in the crypt (panel D). Bars, 100 yaa. (panels A and C), 40 pm 
(panels B and D). 
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Sequence of the genomic DNA of theWurine villin gene comprising cis-acimg elements 
capable to promote the trai^ption oW murine villin gene in intestinal mucosa and 
kidney proximal tubules. The sequenc^comprises the transcription initiation site ai 
position 3442 followed by the sequence \exon 1 contaiidng 46 pb. the translation 
initiation codon at position 8993, the sequeA^e of intron 1 extending from nucleotide 
3488 to nucleotide 8981. 
Tism 1 ' 

Targeted expression of the p-galactosidase protein using regulatory sequences of the 
mouse villin gene. The data have been obtained by transgenesis. 

T^i^re 8 : 

Protocols for the preparation of transgenic mice expressing determined nucleotide 
sequences under the control of the murine villin gene regulatory sequences. 

EXPERIMENTAL PROCEDURES 

A large genomic region of the mouse villin gene has been analyzed. A 9 kb regulatory 
region of the mouse villin gene (haibouring 3.5 kb upstream the transcription start site 
and 5.5 kb of the first intron) was able to promote transcription of the Lac Z reporter 
gene in small and large intestines of transgenic mice, in a transmissible manner, and 
thus efficiently directed subsequent p-galactosidase expression in epithelial cells along 
the entire crypt-villus axis. In Ac kidney, the transgene was also expressed in the 
epithelial cells of the proximal tubules but is likely sensitive to the site of integration. A 
construct lacking the first intron restricted p-galactosidase expression to the small 
intestine. Thus, the 9 kb genomic region contains the necessary cis-acting elements to 
recapitulate the tissue-specific expression pattern of the endogenous villin gene. Hooce, 
these regulatory sequences can be used to target heterologous genes in immature and 
differentiated epithelial cells of the small and/or large intestinal mucosa. 
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Here we report the analysis of tissue-specific expression of the mouse villin gene using : 
(i) DNasc I-hypereensitive sites assays, (ii) transient-transfection assays and (iii) 
transgenic mice. 

Cell culture and ex vivo transient transfection. 
5 Human colon carcinoma CaCo2 cells were cultured at 37°C, 10% in C02, in Dulbecco 
modified Eagle medium supplemented with 10% fetal scrum, IX nonessential amino- 
acids and 5 mM L-glutamine. Pig Kidney proximal tubules derived-LLCPKl cells and 
canine kidney distal tubules derived-MDCK cells were cultured at 37°C, 10% C02, in 
Dulbecco modified Eagle medium supplemented with 10% fetal calf serum and S mM 
10 L-glutamine. Cells cultures, approximately 50% confluent in 60 mm-dishes containing 
^ senrm-free medium, wre cotrOTsfected using 15 ^1 of Lipofectin reagent (Life 
Technology es> Inc.) with 5 ^ of each p-galactosidase reporter plasmid construct and 5 
fig of the control plasmid, pRSVLuc, which contains the luciferase gene under the 
"2 control of the Rous sarcoma virus promoter. The serum- fiw mediimi was changed to 
lis growth culture medium 6 h after transfection, and cells were harvested 48 h later. Cell 
extracts were assayed by chemilimiinescent detection of both ^galactosidase (Galacto- 
P Light, Tropix, Inc.) and luciferase (Luciferase Assay Kit, Tropix, Inc.) activities using a 

in 

lununometer (Berthold). |3-galactosidase activity (light units) was corrected for 

?y variations in transfection efficiencies as determined by luciferase activity. The volume 

O 

p£) of cell extracts used in the p-galactosidase and luciferase assays wre adjusted such taht 
the enzyme activity was always within the linear range of the assay. All transfections 
were repeated at least three times. Results are expressed as -fold induction over that of 
the vector without promoter* pBasic. 

25 Primer extension analysis. 

Total RNA was isolated from mouse intestine wMi RNA NOW reagent (Biogentex) 
under the conditions suggested by the supplier. For pttmer extension assay, 2 ng of "P- 
labeled oligonucleotide probe (S'-GAGTGqTGATGTTGAGAGAGCCT-B') 
^ complementary to nucleotides ^Sl to +103 of the irtprine villin cDNA (GenBank 




wo 00/34492 ^Hl PCT/EP98/08009 

14 




Accession No. M98454) was hybtadized with 30 of total RNA at 60^C (0J25 M KCl, 
10 inM Tris-HCl, pH 7.5, 1 uiiAeDTA) for 90 rain. Transcription with 5 U/fil of 
Moloney murine leukemia virus iwerse transcriptase (Life Technologies, Inc.) was 
carried out at 37°C for 90 min in a 300 |J of a solution containing 75 mM KCl, 3 mM 
MgCl^, 50 mM Tris-HCl (pH S.3), IQ mM dithiothreitol, 0.75 mM deoxynucleoside 
triphosophates, 75 p.g/ml aciinomycin Q and 0.3 U/^iI RNasin. The primer extension 
products were separated by electrophoresi\ in denaturing 8% polyacrylamide gels. The 
full-length extension product (105 nucleotiflies) was obtained by comparison with the 
length of the comigraling sequencing reacdm products, A primer extension control 
experiment was performed on the same total XRNA preparation^ using a "'^P-labeled 
ohgonucleotide probe (5'-CATAGTTCTCGfecCGGT-3') complementary to 
nucleotides +63 to 4-80 of the mouse intestinal fhty acid binding protein (I-FABP) 
cDNA and generating a 81 -nucleotide extension produ\t (27). 

DNase I-bypersensitive sites analysis. 

Tissues from 30 mice were used per assay of intestine, kidney, liver and spleen. Nuclei 
preparation and DNase I digestion were perforaied as described (28) with minor 
modifications. Nuclei were digested withoui or with 20 to 160 units of DNase I (DPRF 
Worthington) for 10 min ai 0°C, Genomic DNA was purified by three rounds of (1:1) 
phenol-chloroform extraction followed by chloroforai extraction and precipitation with 
ethanoL 10 jig of each sample was digested overnight with restriction enzyme (BamiU 
oi BglU), The DNA fragments were separated by elcctjophoresis on a 0.8% agarose gel 
in TAE (40 mM Tris (pH 7.2), 20 mM sodium acetate, 1 mM EDTA), transferred onto a 
charged nylon membrane (Hybond-N^, Amersham), and hybridized at 65°C overnight 
with a random-primed (rediprime, Amersham) ^^P-labeled probe. The probe, the BgUl- 
Pstl probe (0,5 kb) (as indicated in Fig. 2) was used to map the DNase I-hypCTsensitive 
sites in the BamHI, BgUI fragments. The filter was washed using (IX SSC (0.15 M 
NaCL, 15mM sodium citrate, pH 7.0), 0.1% SDS, 55°C) and exposed to fdm ovemight 
at -70**C with an intensifying screen. 
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Plasmids construction. 

All constructs describedWere subcloned into the pBluescript II KS vector (Stratagene) 
with fragments isolated from a aDASHU phage containing a 16.3 kb region (9 kb 
upstream and 73 kb downWeam from the translation initiation codon) of the mouse 
villin gene (29). The pDl clonstruct (as described in the Fig. 3B) was prepared by 
ligating a BamUJ fragment ok 5 J kb (L8 kb upstream from the ATG translation 
initiation codon of the mouse vilfb gene, subcloned 5' to the nuclear localization signal- 
p-galactosidase gene-SV40 polyaldenylation site, using a polymerase chain reaction 
(PGR) strategy) at the BamHl siteXin a plasmid containing the 3,7 kb region of the 
mouse villin gene (immediately 5* tdthe 1-8 kb region described above). The pAl and 
pA2 (containing an internal 1 kb deletion) constructs have resulted from several steps 
based on the sites present in the kb region described above and in a plasmid 

containing the 3.5 kb region of the mopse villin gene (immediately 5' to the 3.7 kb 
region). The pCI and pC2 constructs wereWrived from the pAl and pA2 plasmids cut 
with Apal and re-ljgated, respectively. To generate the pBl construct, a Bgin fragment 
(480 bp) from the 3^ kb region described abd^e was excised and cloned into the Kpnl 
site of the pCl plasmid. The pA3, pB3 and pCi^onstructs correspond to the pAl, pBl 
and pCl deleted from the intron 1 (Fig. 3B). THb sequence between the transcription 
initiation start site and the translation initiation cbdon, excluding the intron 1^ was 
deduced from that of the murine villin cDNA (GenQa^ Accession No. M98454) and 
was introduced into the Bgm-Ncdl sites of the pCl Construct by using a dimerized 
oligodimer made of a coding-stranA oligonucleotide (5'- 
GATCTCCCAGGTGGTGGCTGCCTCTTCCAGACAGqCTCGTCCAC.3') and anon 
coding-strand oligonucleotide \ (5'. 

CATGGTGGACGAGCCTGTCTGGAAGAGGCAGCCACG^CCTGGGA-30, 
resulting in the pB3 construct. The pA3 and the pC3 construdts were derived from the 
pB3 plasmid by ligating an Apal fragment (3.1 kb) and a ^^/uVagment (480 bp) from 
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the 3-5 kb region described above, at the Apal siie in the pB3 plasmid respectively. 



»0^t Subcloning steps were confirmed by DNA sequencing. 
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Traosgevlic mice geaeration. 

The transgenes digested with Xhol-NoiU purified by ge! elution and Elutip Columns 
(Schleiche & Schuell). The linear fragments were supended in 10 mM Tris-HCl, pH 7.4, 
0.2 mM EDTA and were injected into the pronuclei of the fertilized eggs of the B6/D2 
mice. Mice carrying transgenes (founders) were first identified by PCR of genomic 
DNA isolated from a short segment of tail to confirm the presence of the p-galactosidase 
gene and then analyzed by Southern blotting to determine the copy number of the 
integrated transgene. Each founder animal harbored one copy of the transgene per 
genome. Small intestine, colon, kidney, stomach, liver, heart, lung, thymus, brain, 
spleen and muscle were dissected from transgenic mice* cut in small pieces, quickly 
frozen in liquid nitrogen-cooled isopentane either prepared for total KNA extraction or 
embedded in Tissue-Tek O.CT. Compound (Sakura Finetek) blocks to perform 
cryosections. 

Reverse transcription-PCR Analysis. 
Total RNA was isolated from mouse tissueis described above, with SV Total KNA 
Isolation System (Promega) under the condiiidns suggested by the supplier. 20 ng of 
pd(N)^ random primer (Phamiacia) were hybridized with 2 jig of total RNA at 70**C for 
10 min in distilled watex. Reverse transcription witlA200 U of Moloney murine leukemia 
virus reverse transcriptase (Superscript II, Life Tectaologies, Inc) was carried out at 
37°C for 90 min in a 20 >J solution of IX First Strand Birffer (Life Technologies, Inc), 10 
mM dithiothreitol, 0.5 mM deoxynucleoside triphosphatae and 0.4 U/^l RNasiiL 2 fJ of 
the resulting cDNAs, were amplified by PCR reaction in 5^\i\ for 40 cycles. Each cycle 
consisted of 60 sec at 94^C, 60 sec at 5 rC (for transgenXand villin) and 57'C (for 
TFIID), and 30 sec at 72»C. For the ti^gene primers, 5'- 
CAACTTCCTAAGATCTCC-3' coding stTand\ and 5'- 
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ATTCAGGCTGCGCAACTGTT-3' non-doding strand were used, generating a 250 bp 
product. For villin amplification 5'-CAACTTCCTAAGATCTCC-3' coding strand 
primer and 5'-GCAACAGTCGCTGGACAT{CACAGG-3' non-coding strand primers 
were used, generating a 473 bp prod^t; for J Fill) amplification 5 - 
CCACGGACAACTGCGTTGAT-3* coding \ strand primer and 5'- 
GGCTCATAGCTACTGAACTG-3' non-coding strand primer were used, generating a 
220 bp product. In all cases, one-fifth of the PGR pKuluct was run on an ethidium 
bromide containing agarose gel. \ 

Detection of ^-gaiacto$idase activity. 

Cryosections (5 pm) fix)m the tissues described above wre then dried overnight at room 
temperamre, fixed with 3% paraformaldehyde for 5 min, washed in phosphate buffered 
saline and incubated in a staining solution thai contained 0.4 mg of 5-bromo-4-chlorD-3- 
indolyl-p-D-galactopyranoside (X-Gal) per ml, 4 mM potassium feiricyanide, 4 mM 
potassium ferrocyanide, 2 mM MgCl^ at 37*C for 8 h. After staining, the sections were 
again washed in phosphate buffered saline, mounted and examined histologically to 
detect the expression of exogenous p-galactosidase. 

RESULTS 

Determination of the transcription start site. 

To determine the transcriptional start site of the mouse villin gene, total RNA was 
isolated fiiom intestine and analysed by primer extension assay using an oligonucleotide 
complementary to the mouse viUin cDNA downsneam of the ATG translational start 
site. The efficiency of the reaction was confirmed by primer extension of the mouse 
intestinal fatty acid binding protein a-FABP) gene (fabpi) from the same RNA 
preparation (27). Analysis of the fabpi extension product on a sequencing gel by 
comparison with a sequence ladder (Fig. 1 A) revealed a strong signal band of a size of 
81 bp as expected. The extension product of villin was 105 pb indicating that the 
transcriptional start site (an adenosine residue subsequently designed as nucleotide +1) 
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was 57 nucleotides upstream of the translati n initiaiion codoj) of the murine villin 
cDNA (Fig. IB). Coizq>arison of the genomic sequence encompassing 9 kb upstream 
from the ATG initiation codon with the cDNA sequence, position of splice site 
consensus sequences in the 9 kb genomic sequence (Fig, IB) and detentiinaiion of the 
transcription start site reveal that the mouse villin gene has one transcription start site 
which is separated from the ATG iniation codon by a 5,5 kb intronic region (Fig,lC). 

DNase I-hypersensitive sites in the mouse viUin gene. 

To characterize the key regulatory regions involved in the specific control of villin 
expression, we have m^ed the DNase I-hypersensitive sites (31) in the mouse villin 
gene (along a region extending 9 kb upstream and 4.4 kb downsn-eam from the 
translation initiation codon, as represented in Fig. 2A). The chromatin form of the 
mouse villin gene in dififereni tissues (intestine, kidney, liver and spleen) was submitted 
to limited DNase I digestion and subsequently digested with the appropriate restriction 
enzymes. Accordingly, nuclei were isolated from intestine, kidney, hver and spleen. The 
Dnasc I-digesied DNA was restricted by Blgll and hybridized with a 0.5 kb probe 
homologous to the 5* of the 7,5 kb BlgU fragment (Fig. 2B). Two sets of Dnase I 
incubation-related fragments were detected, migrating at 5,5 and 2,7 kb, and 
corresponding to hypersensitive sites designated as HS I (located at approximately +5,5 
kb downstream from the transcription start (+1) site, just upstream the ATG initiation 
codon) and HS n (located at approximately +3 kb downstream from the (+1) site), 
respectively. HS I was observed in nuclei isolated from intestine, kidney and liver, 
whereas HS H was only present in intestinal tissue. No specific hypersensitive siies 
were detected in nucleic isolated from spleen. The presence and location of these 
hypersensitives sites were detected in nuclei isolated from spleen. The presence and 
location of these hypersensitives bands were confirmed by stripping and rehybridizing 
the same blot with the 0.8 kb probe (Fig. 2A) homologous to the other end of the 7.5 kb 
Bgin fragment (data not shown). Using BamHI digestion and the 0.5 kb probe (Fig. 
2C). five sets of DNase 1-treated nuclei-related fragments were detected, migrating at 
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3.4, 4 J, 4.7 and proximately 10 and 15 kb, corresponding to the hypersensitive sites 
HS n (previously identified and consequently confirmed), HS III (located at 
approximately -0.5 kb upstream from the (+1) site), HS IV (located at approximately -1 
kb upstream the (+1) site), HS V (located at proximately -10 kb upsrream fitjm the 
(+1) site) and HS VI (located at ^proximately -15 kb upstream fiom ihe (+1) site), 
respectively. HS III was observed in nuclei isolated fiiom both intestine and kidney, 
whereas HS IV was only present in intestinal tissue as HS 11. The hypersensitive site HS 
V and HS VI were only preseni in liver tissue (in which viUin is weakly expressed) and 
were located far upstream from ihe transcription start site in regions (i) which have not 
been subcloned and (ii) which could belong to an adjacent gene; for these reasons, these 
hepatic-specific hypersensitives sites were not analyzed fiirrher. As for BglU digestion, 
no specific hypersensitive sites were detected in nuclei isolated fiom spleen. Using other 
independeni restriction digestions (EcoRI and Hindlll) and the 0.5, 0,8 and 1.25 kb 
probes (Fig, 2A) to map the locations of the hypersensitive sites, similar results were 
obtained (data not shown). 

In conclusion, four major distinct DNase I-hypersensitives sites (HS I to HS IV) were 
shown to be present in the region extending from -1 kb to +5,5 kb in respect to the 
n^scription start site (Fig. 3A) of the mouse viliin gene. These sites were detected in 
intestine (HS I to HS IV), kidney (HS I and HS III) and liver (HS I), tissues in which 
villin is expressed, but they were not found in spleen, a tissue that does not produce 
viliin. These findings correlate with the tissue-specific control of villin gene expression, 
and suggest that the putative critical regulatory elements lie within ihese regions. HS II 
and HS IV were only detected in intestine and are probably associated with tissue- 
specific transcription factors binding sites involved in the positive control of villin gene 
intestinal expression. 

Analysis of promoter activity by transient expression. 

To test the effects of the segments containing the DNase I-hypersensitives sites (Fig. 
3A) on transcriptional activity and to define more precisely the element(s) controlling 
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villin gene expression in the intestme, segments were subcloned upstream of a 
promoterless Lac Z plasmid (coding for the bacterial (J-galactosidase gene with a 
nuclear localization signal sequence) (Fig. 3B). The resulting recombinant plasmids 
were tested by transient transfection assays in cultures cell lines. The construct pAl 
contained all the subcloned regions dowstream from the ATG initiation codon, 
encompassing the four DNase I-hypersensitive sites (HS I to HS IV) described above 
and the 5,5 kb intronic sequence, intion 1 . Plasmids pA2 and pA3 were identical to pAI 
except for the presence of intestine-specific hypersoisitive site HS II and intron 1, 
respectively. Plasmid pBl and plasmid pCI were similar to plasmid pAl, but lacked the 
regions extending from -480 bp to -3.5kb and -100 bp to -3.5 kb according to the 
transcription start site, respectively. Plasmid pC2 was identical to pA2, but lacked the 
region extending from -100 bp to -3.5 kb. Plasmids pB3 and pC3 were identical to pBl 
and pCl except for the presence of intron 1, respectively. The plasmid pDl was 
identical to pAl except for the presence of the transcription start site and the region 
extending upstream from this site. The plasmid pBasic, which does not contain a 
promoter or errhancer, and a pControl plasmid which possesses the SV40 promoter, 
were also tested in each experiment- Transient transfections were perforaied in the 
human colon enterocyte-like CaCo2 ceil line and the pig kidney proximal tubules- 
derived LLCPKl cell line which express villin, and in kidney epithehal cells in which 
no villin expression is detected, MDCK (a canine kidney distal tubules-derived cell 
line). Transcription from the villin promoter was measiu'ed by assajdng ^-galactosidase 
activity in extracts made from the transfected cells, and the results were expressed as - 
fold induction over that of the promoterless vector, pBasic (Fig. 3C). High levels of 
galaciosidase activity in the pControl transfected cell lines (CaCo2 cells, 50*fold over 
that of pBasic; LLCPKl cells, 98-fold) demonstrated the presence of efficient general 
transcriptionAranslation machineries in these cells. Very low levels of p-galactosidase 
activity in pDl both transfected cells compared to pBasic transfected cells showed that 
the uanscription start site was necessary for an efficient specific transcription of the 
reporter gene and that nonspecific transcription was not initiated elsewhere in the villin 
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regulatory sequences. The construct pAl expressed the p-galactosidase gene at the 
highest level in CaCo2 cells (8-fold overpBasic) as compared with LLCPKl cells (L5- 
fold over pBasic) suggesting thai the four DNase I-hypersensitives sites together with 
the first intron are necessary to promote efficiently transcription in cells of intestinal 
origin. Deletion of the firagment containing the intestinal-specific hypersensitive site HS 
11 (pA2) dramatically decreased p-galactosidase expression in CaCo2 cells (2-fold over 
pBasic) to about 25% of that of pAl, demonstrating that a major element which confers 
intestinal activity was confined within this fi:agment. Similar results were obtained when 
the region upstream fi^om the transcription start site (encompassing HS III and HS IV) 
was almost wholly deleted with or without HS n (pCl and pC2, respectively). The 
deletion of the intronic region alone (pA3), or in combination with deleted sequences 
upstream from the transcription start site (pB3 and pC3 extends only firom -480 and - 
100 bp, respectively), affected to a lesser extent p-galactosidase expression in the same 
intestinal cells (5.5-fold overpBasic), with a decrease to only about 65% of that of pAl> 
demonstrating that the regulatory elements which lay within 100 bp were sufficient to 
promote transcription in cultured cells. However the level of (3-galactosidase activity 
increased strongly when the plasmids pA3, pB3 and pC3 were transfected in LLCPKl 
cells (10, 44, and 45-fold over pBasic, respectively) showing that the absence of the first 
intron, in combination with the lack of intestine-specific HS IV, was able to promote 
transcription in a kidney cell line. This would suggest that negative elements which 
confer repression in kidney transcription are confined in these elements. 
To test specificity, the villin promoter-related constructs were transfected in MDCK 
cells, which do not express villin. After transfecdon, these cells showed only base-line 
levels of p-galactosidase activity when compared to pBasic-related activity (data not 
shown), demonstrating that the villin regulatory sequences were unable to promote 
efficient transcription in nonexpressing villin cells, and that consequently the e>cpress]an 
of the reporter gene in CaCo2 and LLCPKl cells is specifically dependent upon these 
regulaioiy sequences. Taken together, these results from transient n:ansfection of 
cultured cells demonstrate thai (i) the mouse villin genomic sequence, extending from - 
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3.5 to -?-5.5 kb, directs specifically an efficient level expression of the ^galactosidase 
reporter gene in intestine-derived cells, (ii) this level is dramaticaly reduced when the 
intronic intestine-specific hypersensitive site HS H or the region upstream fi-om the (+1) 
site is deleted, (iii) lack of the entire first intron seems to partially restore the intestine- 
related ability in promoting transcription, and (iv) lack of the entire first intron in 
combination with intestine-specific hypersensitive site HS IV is correlated with a strong 
increase of ability in promoting transcription in kidney-<ierived cells. 

Analysis of transgenes expression in mice : 

Since the -3.5 to +5.5 kb region of the mouse villin contained the enterocyte-Kke- 
specific promoter/enhancer activity in transient-transfection assays, we examined the 
ability of this region to drive intestine-specific expression of the 3-galactosidase reporter 
gene in transgenic mice. The construct pAl was then prepare, after excision of the 
plasmid sequence, and injected into fertilized eggs. Five foimder animals which 
contained the pAl constmct as a transgene were obtained. The founder mice were 
analysed for mRNA rqporter gene expression in several adult tissues by reverse- 
transcription PGR (RT-PCR) analysis. From the same cDNA samples, products 
encoding p-galactosidase, villin and TFIID were analyzed. The PGR assays enabled 
only the detection of spliced transcribed mRNA, excluding that fiDm genomic DNA 
itself, by means of an exon-connection strategy by combination of a 5' PGR primer fiom 
within the mouse villin promoter sequence just upstream of the splice donor site, and 
the 3' primers &om within the p-galactosidase gene or the villin gene. For each founder, 
no n^rter gene expression was detected in the tissues in which villin mRNAs were not 
detected using the PGR assay (Fig. 4). For all founder mice, the reporter gene 
transcription was detected along the cephalocaudal axis of the gut (duodenum, jejunum, 
ilemru proximal and distal colon) following the intestine-specific expression of fiie villin 
gene (Fig. 4). In the kidney, the transgene was only transcribed in one founder of five 
animals obtained (Fig. 4). TFIID mRNA was present in all samples fi-om tissues in 
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which the reporter gene expression could not be delected (Fig. 4), confirming the quality 
of RNA from these tissues. 

To examine the precise celluiar distribution of transgene expression within the tissues, 
ciyostat sections of small intestine, colon and kidney were prepared and subsequently 
stained for (3-galactosidase enzyme activity. Inununofluorescence analysis of P- 
galactosidase expression was also performed on the same sections and similar results 
were obtained with the two procedures. Sections of small intestine, colon and kidney 
from non-transgenic animals exhibited no detectable ^-galactosidase activity. For four 
of five nransgenic mice, a beterologotis pattern of expression in small intestine and colon 
was observed in this assay. This heterogeneity was due to mosaicism since we examined 
founder animals. The expression was confined to the nucleus of the epithelial cells, as 
expected because the P-gaiactosidase gene contains a nuclear localization sequence 
signal (Fig- 5). The staining was detected by a stronger signal in the villi migrating cells 
when compared with the crypts cells, of both small intestine (Fig. 5 A) and colon (Fig. 
5C) epithelium, thus confirming that the -3.5 to +5.5 kb region of the mouse vilHn gene 
is able to recapitulate precisely the cellular panem of expression, along the crypt-villus 
differentiation axis, of the endogenous villin gene (17). A continuous labelling of all 
cells of the crypt (Fig. 5B and D) was observed, suggesting the expression of the 
transgene in the stem cells (10). It is noted worthy that the intensity of the 
galactosidase staining was suniiar to that of intestinal sections fix>m chimeric animals 
Owbich possess a p-galactosidase gene integrated at the villin locus by homologous 
recombination procedure (32), indicating that the -3.5 to +5.5 kb region of the mouse 
villin gene was able to promote intestinal transcription as cfificiently as the mouse villin 
gene itself In the kidney of the founder mouse in which the transgene was detected by 
RT'PC3l, the staining was only observed in the epithelial cells of the proximal tubules 
where the villin gene is expressed. The founder animals were able to transmit the 
transgene to their oflfepring with a shnilar pattern of {^galactosidase expression. In our 
atiempt to direct an efficient expression of the reporter gene in the mtestinal epithelium 
with shorter regulatory sequences, plasmids pA3, pB3 and pC3 were used to generate 
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transgenic mice, because these constructs display efiBcient levels of p-galactosidase 
activity in intestine-derived CaCo2 cells. The presence of the transgene assessed by 
galactosidase staining and immunofluorescence procedures was observed in three of the 
four independent lines of pA3 transgenic mice generated. These three lines expressed 
5 the reporter gene only in the small intestine (in both the immature and differentiated 
epithelial cells along the cryptA^illus axis), and all three lines failed to express the 
transgene in the other tissues tested, particularly note worthy is the lack of expression in 
the colon and the kidney (data not shown)* These results demonstrate that (i) the 3.5 kb 
regulatory region upstream the transcription start site of the mouse villin gene is 
10 necessary and suCRcient to sustain expression strictly in small intestine of transgenic 
mice, (ii) the first intron of the mouse vilhn gene is required for colon and kidney 
expression in transgenic mice. Concerning the pB3 and pC3 transgenic mice, no 
transgene expression was observed in all tissues examined, including small intestine, 
colon and kidney. Thxis, the key cis-acting elements of the villin gene required for 
fi5 intestinal and/or kidney-related expression of transgene(s) in transgenic mice are not 

in 

located only within the region encompassing -480 bp upstream fix>m the transcription 
start site, as observed in the cultured epithelial cells. 

]^ DISCUSSION 

120 In this report, we demonstrate that cis-acting sequences located within a 9kb region 
(-3.5 to 4-5.5 kb firom the start site of transcription) of the mouse villin gene are 
sufficient to direct both correct tissue-specific and high expression level of the ^ 
galactosidase reporter gene in transgenic mice, when compared with the endogenous 
gene (19), Rq>orter gene expression is detected in the whole intestinal tube and 

25 appropriately restricted to epithelial cells along the crypt-viUus axis of both small 
intestine and colon, hi addition, these regulatory elements can maintain a gradient of ^ 
galactosidase gene expression from the crypts of LieberkOhn to tips of villi that 
precisely reproduce the gradient exhibited by the murine viUin gene (17). Similarities 
between transgene and endogenous gene expression were also noticed as judged by a 
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comparison with the staming intensity of p^galactosidase activity in intestinal sections 
from our transgenic mice and mice in which the reporter gene has been inserted at the 
natural villin locus by homologous recombination (32). 

In the kidney, for only one animal of five analyzed, mouse reporter gene expression was 
restricted to epithelial cells of the proximal tubules recapitulating the villin expression 
pattern in this tissue. This suggests that transcriptional mechanisms specifying gene 
expression to intestine and kidney tissues are in the -3,5 to -^5.5 kb region of the mouse 
villin gene, and that those related to kidney may be sensitive to positional effects. 
Indeed it is known that the transgene expression is dependent on site of chromosomal 
integration, and can be influenced by regulatory regions in the vicinity, presumably 
acting on chromatin conformation (33). The construct lacking entirely the first intron of 
5.5 kb, but which harboures 3.5 kb 5' to the start site of transcription of the mouse villin 
gene, placed in front of the p-galacosidase gene, restricts the w vivo expression of the 
reporter, gene only into the epithelial cells along the crypt-villus axis of the small 
intestine- The extinction of the reporter gene expression in the kidney might be due to 
strong positional effects, as rqsorted above, whereas the extinction related to the colon 
might be due to the absence of regulatory elements of the intron 1, such as the intestine- 
specific DNase I-hyperscnsitive site HS II. Constructs harbouring only the first 480 bp 
and 100 bp 5' to the start site of transcription, in combination with the lack of the first 
intron, placed in front of the p-galacosidase gene, both failed to drive intestine-specific 
and kidney-specific expression of p-galactosidase, suggesting that the intestine-specific 
DNase 1-hypersensitive site HS IV localized just upstream from the 480 bp might play 
an important role in promoting reporter gene expression into the epithelial cells of the 
small intestine. Thus, distinct and separable regulatory elements in the motise villin 
gene may direct transgene expression along the cephalocaudal axis of the gut: the 
regulatory elements required for transgene expression in the small intestine might be 
localized in the 3,5 kb region (i.e, the HS IV site) upstream ftx>m the transcription start 
site, whereas those necessary for the colonic expression might be localized in the first 
intron (i.e. the HS II site). The inabihty of shorter regulatory sequences of the mouse 
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viUin gene to direct correct expression of the reporter gene in the whole intestine of 
transgenic mice might also be explained by spatial reairangement of chromatiiie 
structure due to the lack of the entire first intron. In fact, the results described here are 
reminiscent of those of the adenosine deaminase gene (34) and the aldolase Bgene (35) 
5 in which elements located in the first intron could be required for transgene expression 
in vivOy because they may contain cis-acting tissue-specific enhancer elements and/or 
elements involved in promoting decondensation of the chromatin structure, allowing the 
accessibiliy for transcription factors and RNA polymerase. 

To explain the discrepancy seen in the abihty of the mouse villin gene regulatory 
10 elements to promote transcription of the reporter gene in cell cultures versus transgenic 
Q animals, we may argue that the regulation of gene expression in the intestinal epithelium 

occiu^ as cells differentiate and migrate along the crypt-villus axis. This process 
\| depends on the contacts that these cells maintain with others neighboring cells on the 
j4 one hand^ and with the extracellular matrix on the other hand (36). Thus, an ex vivo 
\§5 system as the intestine-derived CaCo2 cell line used in the study, is limited by its weak 
f ability in rec^itulating ttie temporal and spatial complexities of this epithelium and 
^™ emphasizes the importance to use in vivo models to define a ftmction for specific 
!□ regulatory sequences (37, 38), 

1 2 Previous studies carried out in .transgenic mice to map transcriptional regulatory 
[SO elements responsible for intestinal expression have been performed using cis-acting 
sequences of genes expressed in villus associated-enterocytes of small intestine (4, 5, 
38-40). In some of these cases, precocious activation in the crypts in combination with 
extended expre^ion in the colon occurs in an inappropriate manner. Thus> to our 
knowledge, the 9kb regulatory region of the mouse villin gene represents title only 
25 characterized cis-acting sequences reported today that allow the expression of a 
heterologous gene in small intestine and colon epithelial cells of transgenic mice 
reproducing with great fideh'ty the tissue-specific and cell-specific pattern of expression 
when compared with that of the endogenous gene itselfl In addition, the mice lines that 
drive a transgenic expression exclusiv ly restricted to the intestinal mucosa could 
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already be studied after selection of those which will not display expression into the 
kidney becaaise of the positional effects. 

The ability to target genes of interest in transgenic mice following the villin resnicted- 
pattern of expression, and particularly in the ciypts stem cells enables to the 
development of taiseted genes in animal models. Experimental mouse models 
reproducing several steps of human colorectal carcinogenesis (a possible genetic 
pathway has been proposed by Fearon and Vogelstein (41)) could for instance be 
obtained by efBciently targeting the associated oncogenes or mutated tumor suppressor 
genes to colonocytes using the villin regulatory region. Another use could lie in the 
establishment of new cell lines derived fix>m the digestive tract by targeting a 
tbermosensitjve SV40 T antigen to the crypts resident-progenitors of intestinal cells, as 
used in other systems (42-44). 

Several of these applications are illustrated in the proposed protocols disclosed on 
Figure 8. 
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